A Computational Platform for Development of Morphologic and Phonetic Lexica
نویسندگان
چکیده
Statistic approaches in speech technology, either based on statistical language models, trees, hidden Markov models or neural networks, represent the driving forces for the creation of language resources (LR), e.g. text corpora, pronunciation lexica and speech databases. This paper presents the system architecture for rapid construction of morphologic and phonetic lexica for Slovenian language. The integrated graphic user interface focuses in morphologic and phonetic aspects of the Slovenian language and allows the experts good performance in analysis time.
منابع مشابه
COLDIC, a Lexicographic Platform for LMF compliant lexica
Despite of the importance of lexical resources for a number of NLP applications (Machine Translation, Information Extraction, Event Detection and Tracking, Question Answering, among others), there has been a traditional lack of generic tools for the creation, maintenance and management of computational lexica. The most direct obstacle for the development of such generic tools, that is, independ...
متن کاملLexicon and Corpora for Speech to Speech Translation (LC-STAR)
The objective of the EU-project LC-STAR (Lexica and Corpora for Speech-to-Speech Translation Components) is corpora collection and lexica creation for the purposes of Automatic Speech Recognition (ASR) and Text-to-speech (TTS) that are needed in speech-to-speech translation (SST). During the lifetime of the project (2002-2005) these lexica will be specified, built and validated. Large lexica co...
متن کاملPetra, osiris and molinspiration: A computational bioinformatic platform for experimental in vitro antibacterial activity of annulated uracil derivatives
Annulated pyrano[2,3-d]pyrimidine/pyrano[2,3-d]uracil derivatives were synthesized using aromatic aldehydes, active methylene compounds and barbituric acid in presence of dibutylamine (DBA) catalyst in ethanol as solvent. The different substituents on phenyl ring in the fused pyrano uracil skeleton showed productive influence on its antimicrobial activity against some gram positive and gram neg...
متن کاملSpecifications of Building Polish Lexica for Application in ASR and TTS Systems
This paper brings detailed information concerning the specifications of building Polish lexica of common and special application words for use in speech applications such as ASR (automatic speech recognition) or TTS (text-to-speech) synthesis. The specifications include information on the collection of text corpora and word lists, phonetic, grammatical and morphological annotation, as well as s...
متن کاملAutomatic Phonetic Transcription by Phonological Derivation
Automatic phonetic transcription tools usually perform phonetic transcriptions directly from orthographic representations. Although these approaches often achieve good results, theoretical studies suggest that including morphophonological knowledge allows those systems to improve their performance. Following this idea, we developed a tool which first obtains an underlying representation of each...
متن کامل